Robust speech recognition by integrating speech separation and hypothesis testing

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CASA based speech separation for robust speech recognition

This paper introduces a speech separation system as a front-end processing step for automatic speech recognition (ASR). It employs computational auditory scene analysis (CASA) to separate the target speech from the interference speech. Specifically, the mixed speech is preprocessed based on auditory peripheral model. Then a pitch tracking is conducted and the dominant pitch is used as a main cu...

متن کامل

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

\eigenlips" for Robust Speech Recognition \eigenlips" for Robust Speech Recognition

In this study we improve the performance of a hybrid connectionist speech recognition system by incorporating visual information about the corresponding lip movements. Speciically, we investigate the beneets of adding visual features in the presence of additive noise and crosstalk (cocktail party eeect). Our study extends previous experiments by using a new visual front end, and an alternative ...

متن کامل

improving the performance of mfcc for persian robust speech recognition

the mel frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. in this paper to achieve a satisfactorily performance in automatic speech recognition (asr) applications we introduce a noise robust new set of mfcc vector estimated through following steps. first, spectral mean normalization is a pre-processing which applies to t...

متن کامل

Japanese speech databases for robust speech recognition

At ATR, a next-generation speech translation system is under development towards natural trans-language communication. To cope with the various requirements to speech recognition technology for the new system, further research efforts should emphasize the robustness for large vocabulary, speaking variations often found in fast spontaneous speech and speaker variances. These are key problems to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Speech Communication

سال: 2010

ISSN: 0167-6393

DOI: 10.1016/j.specom.2009.08.008